System and method for utilizing speech recognition to efficiently perform data indexing procedures

ABSTRACT

A system and method for utilizing speech recognition to efficiently perform data indexing procedures includes an authoring module that coordinates an authoring procedure for creating an index file that has pattern word sets corresponding to data objects stored in a memory of a host electronic device. The pattern word sets are generated with a speech recognition engine that transforms spoken data descriptions into text data descriptions for creating the pattern word sets. The pattern word sets are associated in the index file with data object identifiers that uniquely identify the corresponding data objects. A retrieval module manages a retrieval procedure in which the speech recognition engine converts a spoken data request into a text data request. The retrieval module compares the text data request and the pattern word sets to identify a requested object identifier for locating a requested data object from among the data objects stored in the memory of the host electronic device.

BACKGROUND SECTION

1. Field of Invention

This invention relates generally to electronic speech recognitionsystems, and relates more particularly to a system and method forutilizing speech recognition to efficiently perform data indexingprocedures.

2. Description of the Background Art

Implementing robust and effective techniques for system users tointerface with electronic devices is a significant consideration ofsystem designers and manufacturers. Voice-controlled operation ofelectronic devices may often provide a desirable interface for systemusers to control and interact with electronic devices. For example,voice-controlled operation of an electronic device may allow a user toperform other tasks simultaneously, or can be advantageous in certaintypes of operating environments. In addition, hands-free operation ofelectronic devices may also be desirable for users who have physicallimitations or other special requirements.

Hands-free operation of electronic devices may be implemented by variousspeech-activated electronic devices. Speech-activated electronic devicesadvantageously allow users to interface with electronic devices insituations where it would be inconvenient or potentially hazardous toutilize a traditional input device. However, effectively implementingsuch speech recognition systems creates substantial challenges forsystem designers.

For example, enhanced demands for increased system functionality andperformance require more system processing power and require additionalhardware resources. An increase in processing or hardware requirementstypically results in a corresponding detrimental economic impact due toincreased production costs and operational inefficiencies.

Furthermore, enhanced system capability to perform various advancedoperations provides additional benefits to a system user, but may alsoplace increased demands on the control and management of various systemcomponents. Therefore, for at least the foregoing reasons, implementinga robust and effective method for a system user to interface withelectronic devices through speech recognition remains a significantconsideration of system designers and manufacturers.

SUMMARY

In accordance with the present invention, a system and method aredisclosed for utilizing speech recognition to efficiently perform dataindexing procedures. In one embodiment, a system user or otherappropriate entity initially instructs an authoring module of anelectronic device to enter an authoring mode by utilizing any effectivetechniques. The authoring module responsively displays thumbnailrepresentations of stored images on an indexing graphical user interface(GUI) of the electronic device. The foregoing thumbnail representationseach correspond to a different individual image of stored data from amemory of the electronic device.

A system user or other entity may then choose a selected image from thethumbnail images displayed on the indexing GUI for performing theauthoring procedure. The selected image may be chosen using any desiredtechniques to then become the subject image for creating and storing acorresponding set of one or more pattern words into an index file in thememory of the electronic device. The system user or other entity maythen provide a spoken image description corresponding to the foregoingselected image by utilizing any appropriate techniques. For example, thesystem user may speak the spoken image description into a sound sensorof the electronic device.

A speech recognition engine of the electronic device performs a speechrecognition procedure upon a spoken image description to generate acorresponding text image description that the authoring module displaysupon an editing section of the indexing GUI. The system user or otherentity may then utilize the indexing GUI to edit the displayed textimage description to thereby selectively produce one or more patternwords for a corresponding pattern word set. The authoring module thenstores the foregoing pattern word set along with an image identifiercorresponding to the selected image into the index file. The authoringprocedure may be repeated any desired number of times in order togenerate additional sets of pattern words to fully populate the indexfile for subsequently performing one or more image retrieval procedures.

In certain embodiments, a system user or other appropriate entity mayutilize a retrieval module of the electronic device to enter an imageretrieval mode by utilizing any effective techniques. Next, the systemuser or other entity provides a spoken image request corresponding to apreviously-captured image by utilizing any appropriate techniques. Forexample, the system user may speak the spoken image request into thesound sensor of the electronic device. The foregoing spoken imagerequest may include any desired verbal description relating to arequested image from the stored images in the memory of the hostelectronic device.

The speech recognition engine then performs a speech recognitionprocedure upon the spoken image request to thereby efficiently generatea corresponding text image request. The retrieval module thenautomatically performs a search procedure upon the foregoing index filethat was previously created with the authoring procedure discussedabove. The retrieval module may perform the foregoing search procedureby utilizing any effective techniques. For example, in certainembodiments, the retrieval module examines the index file for one ormore search matches between the pattern words and the foregoing textimage request generated by the speech recognition engine.

If the foregoing search procedure locates one or more matching patternwords in the index file, then the retrieval module utilizes acorresponding image identifier to efficiently locate and display theappropriate image upon a display of the electronic device. The retrievalprocedure may be repeated to retrieve and display any desired number ofimages from the stored data in the electronic device. The presentinvention therefore provides an improved system and method forefficiently utilizing speech recognition to perform data indexingprocedures.

BRIEF DESCRIPTION OF THE DRAWINGS

FIG. 1 is a block diagram for one embodiment of an electronic device, inaccordance with the present invention;

FIG. 2 is a block diagram for one embodiment of the memory of FIG. 1, inaccordance with the present invention;

FIG. 3 is a block diagram for one embodiment of the speech recognitionengine of FIG. 2, in accordance with the present invention;

FIG. 4 is a block diagram illustrating functionality of the speechrecognition engine of FIG. 3, in accordance with one embodiment of thepresent invention;

FIG. 5 is a block diagram for one embodiment of the data manager of FIG.2, in accordance with the present invention;

FIG. 6 is a block diagram for one embodiment of the index file of FIG.2, in accordance with the present invention;

FIG. 7 is a block diagram of an indexing graphical user interface (GUI),in accordance with one embodiment of the present invention;

FIG. 8 is a flowchart of method steps for performing an authoringprocedure, in accordance with one embodiment of the present invention;and

FIG. 9 is a flowchart of method steps for performing a retrievalprocedure, in accordance with one embodiment of the present invention.

DETAILED DESCRIPTION

The present invention relates to an improvement in speech recognitionsystems. The following description is presented to enable one ofordinary skill in the art to make and use the invention, and is providedin the context of a patent application and its requirements. Variousmodifications to the embodiments disclosed herein will be apparent tothose skilled in the art, and the generic principles herein may beapplied to other embodiments. Thus, the present invention is notintended to be limited to the embodiments shown, but is to be accordedthe widest scope consistent with the principles and features describedherein.

The present invention comprises a system and method for utilizing speechrecognition to efficiently perform data indexing procedures including anauthoring module that coordinates an authoring procedure for creating anindex file that has pattern word sets corresponding to data objectsstored in a memory of a host electronic device. The pattern word setsare generated by utilizing a speech recognition engine that transformsspoken data descriptions into text data descriptions for creating thepattern word sets. The pattern word sets are associated in the indexfile with data object identifiers that uniquely identify thecorresponding data objects. A retrieval module then manages a retrievalprocedure in which the speech recognition engine automatically convertsa spoken data request into a text data request. The retrieval modulecompares the text data request and the pattern word sets to identify arequested object identifier for locating a requested data object fromamong the data objects stored in the memory of the host electronicdevice.

Referring now to FIG. 1, a block diagram for one embodiment of anelectronic device 110 is shown, according to the present invention. TheFIG. 1 embodiment includes, but is not limited to, a sound sensor 112,an amplifier 116, an analog-to-digital converter 120, a centralprocessing unit (CPU) 122, a device bus 124, an input/output interface(I/O) 126, a memory 130, and a display 134. In alternate embodiments,electronic device 110 may include elements or functionalities inaddition to, or instead of, certain of the elements or functionalitiesdiscussed in conjunction with the FIG. 1 embodiment.

In the FIG. 1 embodiment, sound sensor 112 detects sound energy fromspoken speech, and then converts the detected sound energy into ananalog speech signal that is provided via path 114 to amplifier 116.Amplifier 116 amplifies the received analog speech signal, and providesthe amplified analog speech signal to analog-to-digital converter 120via path 118. Analog-to-digital converter 120 converts the amplifiedanalog speech signal into corresponding digital speech data, and thenprovides the digital speech data to device bus 124.

CPU 122 then accesses the digital speech data on device bus 124, andresponsively analyzes and processes the digital speech data to performspeech recognition procedures according to software instructionscontained in memory 130. The operation of CPU 122 and the softwareinstructions in memory 130 are further discussed below in conjunctionwith FIGS. 2-9. After the speech data has been processed, CPU 122 maythen provide the results of the speech recognition procedures to otherdevices (not shown) via input/output interface 126.

In certain embodiments, electronic device 110 may be implemented as acomputer device. In alternate embodiments, the present invention mayalso be embodied in various other types of electronic devices andsystems. For example, the present invention may be implemented as partof an electronic imaging device, a digital still camera device, acellular telephone, or a personal digital assistant device. In addition,the present invention may be implemented as part of entertainment robotssuch as AIBO™ and QRIO™ by Sony Corporation.

Referring now to FIG. 2, a block diagram for one embodiment of the FIG.1 memory 130 is shown, according to the present invention. Memory 130may comprise any desired storage-device configurations, including, butnot limited to, random access memory (RAM), read-only memory (ROM), andstorage devices such as floppy discs or hard disc drives. In the FIG. 2embodiment, memory 130 includes a device application 210, speechrecognition engine 214, a data manager 218, an index file 222, andstored data 226. In alternate embodiments, memory 130 may includeelements and functionalities in addition to, or instead of, certain ofthe elements and functionalities discussed in conjunction with the FIG.2 embodiment.

In the FIG. 2 embodiment, device application 210 includes programinstructions that are executed by CPU 122 (FIG. 1) to perform variousfunctions and operations for electronic device 110. The particularnature and functionality of device application 210 typically variesdepending upon factors such as the type and particular use of thecorresponding electronic device 110.

In the FIG. 2 embodiment, speech recognition engine 214 includes one ormore software modules that are executed by CPU 122 to analyze andrecognize input sound data. Certain embodiments of speech recognitionengine 214 are further discussed below in conjunction with FIGS. 3-4. Inthe FIG. 2 embodiment, data manager 218 includes one or more softwaremodules and other information for performing various data indexingprocedures by utilizing index file 222 and speech recognition engine 214in accordance with the present invention.

Stored data 226 includes, but is not limited to, various types of imagedata like individual digital photographs or images captured with adigital still camera or other appropriate means. In various embodiments,the present invention may also be effectively utilized in conjunctionwith various types of data objects in addition to, or instead of, theforegoing image data. The utilization and functionality of data manager218 and index file 222 are further discussed below in conjunction withFIGS. 8-9.

Referring now to FIG. 3, a block diagram for one embodiment of the FIG.2 speech recognition engine 214 is shown, in accordance with the presentinvention. Speech recognition engine 214 includes, but is not limitedto, a feature extractor 310, an endpoint detector 312, a recognizer 314,acoustic models 336, dictionary 340, and language model 344. Inalternate embodiments, speech recognition engine 214 may includeelements and functionalities in addition to, or instead of, certain ofthe elements and functionalities discussed in conjunction with the FIG.3 embodiment.

In the FIG. 3 embodiment, a sound sensor 112 (FIG. 1) provides digitalspeech data to feature extractor 310 via system bus 124. Featureextractor 310 responsively generates corresponding representativefeature vectors, that are provided to recognizer 314 via path 320.Feature extractor 310 also provides the speech data to endpoint detector312 which responsively identifies endpoints of utterances represented bythe speech data to indicate the beginning and end of an utterance intime. Endpoint detector 312 then provides the foregoing endpoints torecognizer 314.

In the FIG. 3 embodiment, recognizer 314 is configured to recognizewords in a vocabulary which is represented in dictionary 340. Theforegoing vocabulary in dictionary 340 corresponds to any desiredutterances, instructions, narration, or other sounds that are supportedfor speech recognition by speech recognition engine 214.

In practice, each word from dictionary 340 is associated with acorresponding phone string (string of individual phones) that representsthe pronunciation of that word. Acoustic models 336 (such as HiddenMarkov Models) for each of the phones are selected and combined tocreate the foregoing phone strings for accurately representingpronunciations of words in dictionary 340. Recognizer 314 compares inputfeature vectors from line 320 with the entries (phone strings) fromdictionary 340 to determine which word produces the highest recognitionscore. The word corresponding to the highest recognition score may thusbe identified as the recognized word.

Speech recognition engine 214 also utilizes language model 344 todetermine specific recognized word sequences that are supported byspeech recognition engine 214. Recognized sequences of vocabulary wordsmay then be output as the foregoing word sequences from recognizer 314via path 332. The operation and implementation of recognizer 314,dictionary 340, and language model 344 are further discussed below inconjunction with FIG. 4.

Referring now to FIG. 4, a block diagram illustrating functionality ofthe FIG. 3 speech recognition engine 214 is shown, in accordance withone embodiment of the present invention. In alternate embodiments, thepresent invention may perform speech recognition procedures usingtechniques and functionalities in addition to, or instead of, certain ofthe techniques and functionalities discussed in conjunction with theFIG. 4 embodiment.

In the FIG. 4 embodiment, speech recognition engine 214 (FIG. 3)initially receives speech data from a sound sensor 112. A recognizer 314(FIG. 3) from speech recognition engine 214 compares the input speechdata with acoustic models 336 to identify a series of phones (phonestrings) that represent the input speech data. Recognizer 314 referencesdictionary 340 to look up recognized vocabulary words that correspond tothe identified phone strings. The recognizer 314 utilizes language model344 to form the recognized vocabulary words into word sequences, such assentences or phrases, that are supported by speech recognition engine214.

In certain embodiments, the foregoing word sequences are advantageouslyutilized to author an index file 222 (FIG. 2) of pattern words andcorresponding image identifiers for locating specific images in storeddata 226 (FIG. 2), in accordance with the present invention. Theutilization of speech recognition engine 214 to create and utilize indexfile 222 is further discussed below in conjunction with FIGS. 8-9.

Referring now to FIG. 5, a block diagram of the FIG. 2 data manager 218is shown, in accordance with one embodiment of the present invention. Inthe FIG. 5 embodiment, data manager 218 includes an authoring module 514and a retrieval module 518. In alternate embodiments, data manager 218may include components or functionalities in addition to, or instead of,certain of the components and functionalities discussed in conjunctionwith the FIG. 5 embodiment.

In the FIG. 5 embodiment, electronic device 110 utilizes authoringmodule 514 for performing an authoring procedure to create an index file222 (FIG. 2). In the FIG. 5 embodiment, electronic device 110 may thenutilize retrieval module 518 and the foregoing index file 222 to performa retrieval procedure to efficiently and effectively locate and displayone or more selected images from stored data 226 (FIG. 2) upon display134 of electronic device 110. The utilization of authoring module 514 isdiscussed below in conjunction with FIG. 8, and the utilization ofretrieval module 518 is discussed below in conjunction with FIG. 9.

Referring now to FIG. 6, a block diagram for one embodiment of the FIG.2 index file 222 is shown, in accordance with the present invention. Inalternate embodiments, index file 222 may include elements andfunctionalities in addition to, or instead of, certain of the elementsand functionalities discussed in conjunction with the FIG. 6 embodiment.

In the FIG. 6 embodiment, index file 222 includes pattern word(s) 1(612(a)) through pattern word(s) X (612(c)). Index file 222 may beimplemented to include any number of different sets of pattern words 612that may include any desired type of information. In the FIG. 6embodiment, each set of pattern words 612 from index file 222 typicallyincludes one or more search terms, words, or phrases corresponding to aparticular image from stored data 226 (FIG. 2). For example, if theparticular image was a photograph of a dog running on a beach, thecorresponding set of pattern words 612 might include the words “runningdog” and “beach”. Each set of pattern words 612 may include one or morerelationship indicators regarding various types of relationships betweenindividual words from the corresponding set as used by retrieval module518. In addition, each set of pattern words 612 may include one or morepriority indicators regarding various types of priorities for words fromthe corresponding set as used by retrieval module 518.

In the FIG. 6 embodiment, each set of pattern words 612 is associatedwith a corresponding image identifier 616 that uniquely identifies aparticular image in stored data 226 (FIG. 2). For example, patternword(s) 1 (612(a)) corresponds to image identifier 1 (616(a)), patternword(s) 2 (612(b)) corresponds to image identifier 2 (616(b)), andpattern word(s) X (612(c)) corresponds to image identifier X (516(c)).In certain alternate embodiments, pattern words 612 may also beassociated with corresponding object identifiers that each uniquelycorrespond to a different data object.

In the FIG. 6 embodiment, an authoring module 514 (FIG. 5) coordinatesan authoring procedure that utilizes speech recognition engine 214 (FIG.2) to efficiently create pattern words 612 for index file 222. Onceindex file 222 has been created, then a retrieval module 518 (FIG. 5)may coordinate a retrieval procedure that utilizes speech recognitionengine 214 and index file 222 to locate selected images from stored data226 for viewing on display 134 of electronic device 110 (FIG. 1). Theforegoing authoring procedure is further discussed below in conjunctionwith FIG. 8, and the foregoing retrieval procedure is further discussedbelow in conjunction with FIG. 9.

Referring now to FIG. 7, a block diagram for one embodiment of anindexing graphical user interface (GUI) 710 is shown, in accordance withthe present invention. In the FIG. 7 embodiment, indexing GUI 710 mayinclude, but is not limited to, an image section 714 and an editingsection 718. In alternate embodiments, indexing GUI 710 may includeelements and functionalities in addition to, or instead of, certain ofthe elements and functionalities discussed in conjunction with the FIG.7 embodiment.

In the FIG. 7 embodiment, authoring module 514 (FIG. 5) advantageouslygenerates indexing GUI 710 for a system user to interactivelyparticipate in an authoring procedure for populating index file 222 withpattern words 612 and corresponding image identifiers 616. In the FIG. 7embodiment, image section 714 may include, but is not limited to,representations of one or more individual images from stored data 226.For example, image section 714 may display any desired number ofthumbnail representations (having a relatively small size and a reducedresolution) for images from stored data 226. A system user may thenutilize any appropriate means for choosing a selected image displayed onimage section 714.

In the FIG. 7 embodiment, indexing GUI 710 also includes editing section718 for displaying an image description provided by the system user orother appropriate entity. In accordance with certain embodiments of thepresent invention, speech recognition engine 214 converts a spoken imagedescription provided by the system user into a test image descriptionfor display upon editing section 718 of indexing GUI 710. The systemuser or other appropriate entity may then selectively edit the foregoingtext image description on editing section 718 to produce a correspondingset of pattern words 612 for index file 222. The utilization of indexingGUI 710 is further discussed below in conjunction with FIG. 8.

Referring now to FIG. 8, a flowchart of method steps for performing anauthoring procedure is shown, in accordance with one embodiment of thepresent invention. The FIG. 8 flowchart is presented for purposes ofillustration, and in alternate embodiments, the present invention mayutilize steps and sequences in addition to, or instead of, certain ofthe steps and sequences discussed in conjunction with the FIG. 8embodiment.

In the FIG. 8 embodiment, in step 814, a system user or otherappropriate entity initially instructs an authoring module 514 ofelectronic device 110 to enter an authoring mode by utilizing anyeffective techniques. For example, the system user may use a verbalauthoring-mode command that is recognized by a speech recognition engine214 of electronic device 110 to enter the foregoing authoring mode. Instep 818, authoring module 514 responsively displays thumbnail images onan indexing GUI 710 of electronic device 110. The thumbnail images eachcorrespond to a different individual image from stored data 226 inmemory 130 of electronic device 110.

In step 822, a system user or other entity chooses a selected image fromthe foregoing thumbnail images displayed on indexing GUI 710. Theselected image may be chosen using any desired techniques to then becomethe subject image for creating and storing a corresponding set ofpattern words 612 into an index file 222 in memory 130 of electronicdevice 110. In step 826, the system user or other entity provides aspoken image description corresponding to the foregoing selected imageby utilizing any appropriate techniques. For example, the system usermay speak the spoken image description into sound sensor 112 (FIG. 1) ofelectronic device 110.

In step 830, speech recognition engine 214 performs a speech recognitionprocedure upon the spoken image description to generate a correspondingtext image description that authoring module 514 then displays upon anediting section of the indexing GUI 710 of electronic device 110. Instep 834, the system user or other entity utilizes indexing GUI 710 toedit the displayed text image description to thereby selectively produceone or more pattern word(s) 612. In the FIG. 8 embodiment, the displayedtext image description may be edited by deleting, adding, or changingwords to produce the corresponding pattern word(s) 612.

Finally, in step 838, authoring module 514 stores the foregoing patternword(s) 612 along with an image identifier 616 uniquely corresponding tothe selected image into index file 222. The FIG. 8 process may thenterminate. In certain embodiments, the foregoing FIG. 8 authoringprocedure may readily be repeated any desired number of times in orderto generate addition sets of pattern words 612 to fully populate indexfile 222 for subsequently performing one or more image retrievalprocedures, as discussed below in conjunction with FIG. 9.

Referring now to FIG. 9, a flowchart of method steps for performing aretrieval procedure is shown, in accordance with one embodiment of thepresent invention. The FIG. 9 flowchart is presented for purposes ofillustration, and in alternate embodiments, the present invention mayutilize steps and sequences in addition to, or instead of, certain ofthe steps and sequences discussed in conjunction with the FIG. 9embodiment.

In the FIG. 9 embodiment, in step 918, a system user or otherappropriate entity initially instructs a retrieval module 518 ofelectronic device 110 to enter an image retrieval mode by utilizing anyeffective techniques. For example, the system user may use a verbalretrieval-mode command that is recognized by a speech recognition engine214 of electronic device 110 to enter the foregoing image retrievalmode.

In step 922, the system user or other entity provides a spoken imagerequest corresponding to a previously-captured image in stored data 226by utilizing any appropriate techniques. For example, the system usermay speak the spoken image request into sound sensor 112 (FIG. 1) ofelectronic device 110. In the FIG. 9 embodiment, the foregoing spokenimage request may include any desired verbal description relating to arequested image from stored data 226. For example, if the system userwishes to retrieve a particular image of his dog running on a beach, thesystem user might provide “I want to retrieve a picture of my dogrunning on the beach” as the foregoing spoken image request.

In step 926, speech recognition engine 214 performs a speech recognitionprocedure upon the spoken image request to thereby efficiently generatea corresponding text image request. In step 930, retrieval module 518automatically performs a search procedure upon an index file 222 thathas previously been created with an authoring procedure that is the sameor similar to that discussed above in conjunction with FIG. 8. Retrievalmodule 518 may perform the foregoing search procedure by utilizing anyeffective techniques. For example, in certain embodiments, retrievalmodule 518 examines index file 222 for one or more search matchesbetween pattern words 612 and words or phrases from the foregoing textimage request generated by speech recognition engine 214 in foregoingstep 926.

In certain embodiments, retrieval module 518 may prioritize certainwords or phrases from the text image request when evaluating searchmatches between pattern words 612 and foregoing the text image request.For example, word length characteristics may be utilized to prioritizewords from the text image request that have a larger number of lettersor syllables. In addition, word relationship characteristics may be usedto prioritize words from the text image request that fall either at thebeginning or at the ending of a corresponding image request sentence.

In step 934, retrieval module 518 determines whether one or morematching pattern word(s) 612 have been found in index file 222 duringthe foregoing search procedure. If no matching pattern word(s) 612 arefound during the search procedure, then in step 936, retrieval module518 may display a notification message to that effect, and the FIG. 9process may terminate. However, in step 934, if the search procedurelocates matching pattern word(s) 612 in index file 222, then in step940, retrieval module 518 utilizes a corresponding image identifier 616to efficiently locate and display a corresponding image from stored data226 upon display 134 of electronic device 110. The FIG. 9 retrievalprocedure may be repeated to retrieve and display any desired number ofimages from stored data 226 in electronic device 110. The presentinvention therefore provides an improved system and method forefficiently utilizing speech recognition to perform data indexingprocedures.

The invention has been explained above with reference to certainpreferred embodiments. Other embodiments will be apparent to thoseskilled in the art in light of this disclosure. For example, the presentinvention may be implemented using certain configurations and techniquesother than those described in the embodiments above. Additionally, thepresent invention may effectively be used in conjunction with systemsother than those described above as the preferred embodiments.Therefore, these and other variations upon the foregoing embodiments areintended to be covered by the present invention, which is limited onlyby the appended claims.

1. A system for indexing electronic information, comprising: anauthoring module that coordinates an authoring procedure for creating anindex file that includes pattern word sets corresponding to data objectsstored in a memory device, said pattern word sets being generated with aspeech recognition engine that transforms spoken data descriptions froma system user into text data descriptions, said pattern word sets beingassociated with data object identifiers that identify said data objects;and a retrieval module that manages a retrieval procedure in which saidspeech recognition engine converts a spoken data request into a textdata request, said retrieval module comparing said text data request andsaid pattern word sets to identify a requested object identifier forlocating a requested data object from among said data objects stored insaid memory device.
 2. The system of claim 1 wherein said pattern wordsets each include one or more search terms derived from said text datadescriptions, said search terms including one or more key words thatrelate to said data objects.
 3. The system of claim 1 wherein anindexing graphical user interface is utilized to create said patternword sets, said indexing graphical user interface including an objectsection for displaying representations of said data objects, saidindexing graphical user interface also including an editing section forconverting said text data descriptions into said pattern word sets. 4.The system of claim 1 wherein said data objects include stored imagescreated by an imaging device, said stored images being saved in saidmemory device of a host electronic device.
 5. The system of claim 4wherein said authoring module instructs said host electronic device toenter an authoring mode for performing said authoring procedure inresponse to a verbal authoring-mode command that is recognized by saidspeech recognition engine.
 6. The system of claim 4 wherein saidauthoring module displays thumbnail representations of said storedimages on a display of said host electronic device to facilitate saidauthoring procedure.
 7. The system of claim 4 wherein a selected imageis chosen from thumbnail representations of said stored images forcreating a corresponding one of said pattern word sets, said selectedimage being chosen by a system user after viewing said thumbnailrepresentations on an indexing graphical user interface.
 8. The systemof claim 4 wherein said spoken data description is provided by a systemuser speaking into a sound sensor of said host electronic device, saidspoken data description including a verbal description of a selectedimage from said stored images.
 9. The system of claim 4 wherein saidretrieval module instructs said host electronic device to enter aretrieval mode for performing said retrieval procedure in response to averbal retrieval-mode command that is recognized by said speechrecognition engine.
 10. The system of claim 4 wherein said spoken datarequest is provided by a system user speaking into a sound sensor ofsaid host electronic device, said spoken data request including a verbalrequest related to a selected image from said stored images.
 11. Thesystem of claim 4 wherein said retrieval module displays said requesteddata object on a display of said host electronic device after saidretrieval procedure has been performed.
 12. The system of claim 1wherein said speech recognition engine performs speech recognitionprocedures upon said spoken data descriptions to produce said text datadescriptions, said speech recognition engine being implemented as alarge-vocabulary continuous speech recognizer that includes acousticmodels, a dictionary, and a language model.
 13. The system of claim 1wherein said authoring module displays said text data descriptions on anediting section of an indexing graphical user interface for editing intosaid pattern word sets.
 14. The system of claim 1 wherein said authoringmodule edits said text data descriptions on an indexing graphical userinterface according to editing instructions from a system user toproduce said pattern word sets, said editing instructions including achange-word instruction, a delete-word instruction, and an add-wordinstruction.
 15. The system of claim 1 wherein said authoring modulestores each of said pattern word sets along with a corresponding one ofsaid data object identifiers into said index file for subsequentlyperforming said retrieval procedure.
 16. The system of claim 1 whereinsaid speech recognition engine performs speech recognition proceduresupon said spoken data request to produce said text data request, saidspeech recognition engine being implemented as a large-vocabularycontinuous speech recognizer that includes acoustic models, adictionary, and a language model.
 17. The system of claim 1 wherein saidretrieval module accesses said index file for performing a searchprocedure that identifies said requested data object by seeking searchmatches between said text data request and said pattern word sets. 18.The system of claim 17 wherein said retrieval module performs saidsearch procedure by utilizing pre-defined priority indicators orpre-defined relationship indicators.
 19. The system of claim 18 whereinsaid pre-defined priority indicators are used to evaluate search termsfrom said text data request based upon corresponding word lengths. 20.The system of claim 18 wherein said pre-defined relationship indicatorsare utilized to evaluate search terms from said text data requestdepending on whether said search terms are located at the beginning orthe end of said text data request.
 21. A method for indexing electronicinformation, comprising: performing an authoring procedure for creatingan index file by utilizing an authoring module, said index fileincluding pattern word sets corresponding to data objects stored in amemory device, said pattern word sets being generated with a speechrecognition engine that transforms spoken data descriptions from asystem user into text data descriptions, said pattern word sets beingassociated with data object identifiers that identify said data objects;and utilizing a retrieval module to manage a retrieval procedure inwhich said speech recognition engine converts a spoken data request intoa text data request, said retrieval module comparing said text datarequest and said pattern word sets to identify a requested objectidentifier for locating a requested data object from among said dataobjects stored in said memory device.
 22. The method of claim 21 whereinsaid pattern word sets each include one or more search terms derivedfrom said text data descriptions, said search terms including one ormore key words that relate to said data objects.
 23. The method of claim21 wherein an indexing graphical user interface is utilized to createsaid pattern word sets, said indexing graphical user interface includingan object section for displaying representations of said data objects,said indexing graphical user interface also including an editing sectionfor converting said text data descriptions into said pattern word sets.24. The method of claim 21 wherein said data objects include storedimages created by an imaging device, said stored images being saved insaid memory device of a host electronic device.
 25. The method of claim24 wherein said authoring module instructs said host electronic deviceto enter an authoring mode for performing said authoring procedure inresponse to a verbal authoring-mode command that is recognized by saidspeech recognition engine.
 26. The method of claim 24 wherein saidauthoring module displays thumbnail representations of said storedimages on a display of said host electronic device to facilitate saidauthoring procedure.
 27. The method of claim 24 wherein a selected imageis chosen from thumbnail representations of said stored images forcreating a corresponding one of said pattern word sets, said selectedimage being chosen by a system user after viewing said thumbnailrepresentations on an indexing graphical user interface.
 28. The methodof claim 24 wherein said spoken data description is provided by a systemuser speaking into a sound sensor of said host electronic device, saidspoken data description including a verbal description of a selectedimage from said stored images.
 29. The method of claim 24 wherein saidretrieval module instructs said host electronic device to enter aretrieval mode for performing said retrieval procedure in response to averbal retrieval-mode command that is recognized by said speechrecognition engine.
 30. The method of claim 24 wherein said spoken datarequest is provided by a system user speaking into a sound sensor ofsaid host electronic device, said spoken data request including a verbalrequest related to a selected image from said stored images.
 31. Themethod of claim 24 wherein said retrieval module displays said requesteddata object on a display of said host electronic device after saidretrieval procedure has been performed.
 32. The method of claim 21wherein said speech recognition engine performs speech recognitionprocedures upon said spoken data descriptions to produce said text datadescriptions, said speech recognition engine being implemented as alarge-vocabulary continuous speech recognizer that includes acousticmodels, a dictionary, and a language model.
 33. The method of claim 21wherein said authoring module displays said text data descriptions on anediting section of an indexing graphical user interface for editing intosaid pattern word sets.
 34. The method of claim 21 wherein saidauthoring module edits said text data descriptions on an indexinggraphical user interface according to editing instructions from a systemuser to produce said pattern word sets, said editing instructionsincluding a change-word instruction, a delete-word instruction, and anadd-word instruction.
 35. The method of claim 21 wherein said authoringmodule stores each of said pattern word sets along with a correspondingone of said data object identifiers into said index file forsubsequently performing said retrieval procedure.
 36. The method ofclaim 21 wherein said speech recognition engine performs speechrecognition procedures upon said spoken data request to produce saidtext data request, said speech recognition engine being implemented as alarge-vocabulary continuous speech recognizer that includes acousticmodels, a dictionary, and a language model.
 37. The method of claim 21wherein said retrieval module accesses said index file for performing asearch procedure that identifies said requested data object by seekingsearch matches between said text data request and said pattern wordsets.
 38. The method of claim 37 wherein said retrieval module performssaid search procedure by utilizing pre-defined priority indicators andpre-defined relationship indicators.
 39. The method of claim 38 whereinsaid pre-defined priority indicators are used to evaluate search termsfrom said text data request based upon corresponding word lengths. 40.The method of claim 38 wherein said pre-defined relationship indicatorsare utilized to evaluate search terms from said text data requestdepending on whether said search terms are located at the beginning orthe end of said text data request.
 41. The method of claim 37 whereinsaid retrieval module performs said search procedure by utilizingpre-defined priority indicators to evaluate search terms from said textdata request.
 42. The method of claim 37 wherein said retrieval moduleperforms said search procedure by utilizing pre-defined relationshipindicators to evaluate search terms from said text data request.
 43. Acomputer-readable medium comprising program instructions for indexingelectronic information by: performing an authoring procedure forcreating an index file by utilizing an authoring module, said index fileincluding pattern word sets corresponding to data objects stored in amemory device, said pattern word sets being generated with a speechrecognition engine that transforms spoken data descriptions from asystem user into text data descriptions for creating said pattern wordsets, said pattern word sets being associated with data objectidentifiers that identify said data objects; and utilizing a retrievalmodule to manage a retrieval procedure in which said speech recognitionengine converts a spoken data request into a text data request, saidretrieval module comparing said text data request and said pattern wordsets to identify a requested object identifier for locating a requesteddata object from among said data objects stored in said memory device.44. A system for cataloguing electronic information, comprising: meansfor performing an authoring procedure for creating an index file thatincludes pattern word sets corresponding to data objects stored in amemory device, said pattern word sets being generated with a speechrecognition engine that transforms spoken data descriptions from asystem user into text data descriptions for creating said pattern wordsets, said pattern word sets being associated with data objectidentifiers that identify said data objects; and means for managing aretrieval procedure in which said speech recognition engine converts aspoken data request into a text data request, said means for managingthen comparing said text data request and said pattern word sets toidentify a requested object identifier for locating a requested dataobject from among said data objects stored in said memory device.